Association between Modulation Spectrum and Speech Intelligibility of Syllable-timed Languages
نویسندگان
چکیده
Previous studies showed that both amplitude [1, 6] and phase [4] of the Modulation Spectrum (MS) of speech waveforms play an important role in preserving intelligibility in stress-timed languages like English. In the current study, association between MS and speech intelligibility of spoken sentences in Mandarin and Cantonese which are typical syllable-timed languages [7, 8], is investigated. The manipulation of “local time reversal” on speech waveforms was employed for each sentence. Speech identification accuracies were calculated and MS analysis was implemented. It is found that both amplitude and phase of the MS components at the corresponding syllabic rates of the spoken sentences are contributing to speech identification. We suggest that this work will help us understand more about the relation between speech intelligibility and speech acoustics, especially for syllable-timed languages.
منابع مشابه
What are the Essential Cues for Understanding Spoken Language?
Classical models of speech recognition assume that a detailed, short-term analysis of the acoustic signal is essential for accurately decoding the speech signal and that this decoding process is rooted in the phonetic segment. This paper presents an alternative view, one in which the time scales required to accurately describe and model spoken language are both shorter and longer than the phone...
متن کاملمقایسه سطح ادراک شنیداری و وضوح کلامی بعد از کاشت حلزون در بیماران پرهلینگوال مبتلا به کمشنوایی عمیقی ارثی و غیرارثی مراجعه کننده به بیمارستان حضرت رسول اکرم(ص)
Background & Aim: When inner ear is disturbed, both hearing sensitivity and selective property decrease. Early rehabilitation for proper progression of speech and language appropriate to age is mandatory. Several studies were performed to compare factors that affect the results of cochlear implantations to select the best candidates on the basis of different criteria. This study was underta...
متن کاملOn the Role of Theta-Driven Syllabic Parsing in Decoding Speech: Intelligibility of Speech with a Manipulated Modulation Spectrum
Recent hypotheses on the potential role of neuronal oscillations in speech perception propose that speech is processed on multi-scale temporal analysis windows formed by a cascade of neuronal oscillators locked to the input pseudo-rhythm. In particular, Ghitza (2011) proposed that the oscillators are in the theta, beta, and gamma frequency bands with the theta oscillator the master, tracking th...
متن کاملThe role of speech rate in perceiving speech rhythm
Human listeners can distinguish between languages of different rhythmic classes (e.g. stressand syllable-timed languages). The present study investigated the role of speech rate in this process. Acoustic data suggests (experiment I) that speech rate can distinguishes as reliable between stressand syllable-timed languages as previously proposed correlates of speech rhythm (%V, VarcoC and nPVI). ...
متن کاملTitle: Syllable Intelligibility for Temporally--ltered Lpc Cepstral Trajectories
Running Title: Syllable intelligibility for ltered cepstral trajectories Received: T. Arai et al., \Syllable intelligibility for ltered cepstral trajectories," JASA 2 Abstract We measured the intelligibility of syllables whose cepstral trajectories were temporally ltered. The speech signals were transformed to their LPC cepstral coeecients, and these coeecients were passed through diierent lter...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011